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content transformer (120), and interfaces 
to data providers (130a-130n). The telco 
interface (110) may be configured to 
provide access to users over a public 
switched telephone network (102). A user 
may access the system utilizing a single 
telephone number, which may be toll-free 
to provide an easier method of accessing the 
system. As the system answers a call from a 
user, the telco interface (1 10) may be further 
configured to provide an audio interface 
for users to interact with to access and to 
retrieve user-specific audio content from 
the system. With the audio interface, users 
may vocally request an item of information. 
Automated voice recognition units of the 
telco interface (110) speech process the 
request into a speech data value, which is 
forwarded to the content transformer (120). 
The content transformer (120) returns the 
corresponding audio content related to the 
speech data value, which is then replayed 
to i 



ru « 

WO 01/47218 Al IIIllBlliOllllllIIIIlllIIIIIll 



patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, For two-letter codes and other abbreviations, refer to the "Guid- 

IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, ance Notes on Codes and Abbreviations" appearing at the begin- 

CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). ning of each regular issue of the PCT Gazette. 

Published: 

— With international search report. 



•WO 01/47218 PCT/US00/34531 



SYSTEM FOR ON-DEMAND DELIVERY OF USER-SPECIFIC 

AUDIO CONTENT 

1. BACKGROUND OF THE INVENTION 

5 1. Field of the Invention 

The invention relates to the distribution of information content. 
More particularly, the invention relates to the assembly and delivery of 
information content in response to a user request by the use of speech recognition 
techniques, and the delivering of information content in audio format from a 
1 0 remote source using a telephone. 

2. Background 

The growth of the Internet has coincided with, the growth of the 

availability of information for users of the Internet. With the rapid proliferation 
15 of Web sites, a typical user may access information relating to a variety of subject 

matter such as education, research, news, shopping, opportunities, employment, 

etc., in virtually real time. 

To access the information from the Internet, a typical user requires 

a desktop computer with access to an Internet Service Provider ("ISP"). An ISP 
20 provides access between the users and the Internet. The ISP typically charges a 

monthly fee for this access which may become prohibitive for some users. 

Moreover, since access to the Internet is often tied to a stationary 

desktop computer, often the user's mobility is limited while accessing the Internet. 

Recently, portable electronics manufacturers have introduced many products 
25 which allow mobile access, i.e., wireless, to the Internet. However, these 

products are often a separate purchase from the desktop computer requiring 

additional expenditures. Moreover, the user may also have to purchase a wireless 

services account from a wireless services provider, which may also become cost 

prohibitive. 

30 Besides the Internet, other media exist for transmitting information 

to users. For example, radio and television are media that users often use to 
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obtain information. Radio and television stations transmit their programming 
over the electromagnetic spectrum. The range of the radio and television stations 
is typically dictated by the strength of their signals. The loss of a radio or 
television station is a viable possibility for a mobile user who may move out of 
the range of radio or television station. 

Moreover, a typical user may not customize the type of 
information received from a radio and/or television station. Radio/television 
stations are programmed by station managers, which is often dictated by sponsors 
and popularity. Accordingly, a user may be forced to wait till a radio/television 
station transmits, if ever, the relevant desired information. This waiting may be 
an inconvenience for the user. 

There are also information providers that provide users with 
relevant real-time information. For instance, many local telephone services 
provide a telephone number in which a caller may call for the local forecast, 
traffic conditions or the like. However, this type of service is limited to a specific 
geographic area and this service only offers fixed types of information. 

Some information providers even provide telephonic information 
services where a user may dial a single number to access a variety types of 
information. Once a user calls a designated number, the user is prompted by 
audio cues for navigating the telephonic information service. For a given audio 
cue, the user is typically given choices from available information services each 
corresponding to one or more keys of the keypad of the telephone. Accordingly, 
the user navigates through the telephonic information service by using the keypad 
of the telephone. 

Although this type of telephone information service does provide 
useful information, the use of the keypad limits the usefulness of these telephonic 
information services. For instance, if a user desires information that is several 
layers within the menu of the telephonic information service, the user has to listen 
through the several layers of the menu to retrieve the one desired item of 
information. Accordingly, a user may spend more time than necessary utilizing 
the telephonic information services. 
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Alternatively, a user may memorize the keypad sequence of 
navigation to retrieve an item of relevant information to overcome the wait of 
navigating a several-layered menu. However, this solution forces users to 
memorize key-press sequences on the keypad. Although this solution is 
5 reasonable for one or two items of information, it becomes more problematic as 
the items of information increases and as the layers of menu on the telephonic 
information service increases. 

Moreover, as users become increasingly mobile, users may still 
utilize a telephonic information system in order to retrieve information while 
10 driving. As a user is navigating the menu structure of the telephonic information 
system, the user may be forced to take his eyes off the road, thus, becoming a 
hazard to himself and to others. 

SUMMARY OF THE INVENTION 

15 In accordance with the principles of the present invention, a 

method of delivering audio content to a user. The method includes requesting an 
audio query by the user over a telephone, and translating the audio query into a 
database query for requesting the audio content from an audio content provider. 
The method further includes transmitting an audio response of the database query 

20 to the user over the telephone. 

One aspect of the present invention is a method of delivering audio 
content to a user. The method includes receiving a vocal request at an audio 
interface of an audio content provider from the user where the audio interface 
configured to convert said vocal request into an electronic query. The method 

25 also includes retrieving a requested audio content from the audio content provider 
by the audio interface is in response to the electronic query. The method further 
includes outputting the requested audio content by the audio interface where the 
audio interface is further configured to output the requested audio content to said 
user over a public switched telephone network. 

30 Another aspect of the present invention is a method of browsing a 

network, where the method includes receiving a vocal browsing request at an 
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audio interface of a content provider from a user. The method includes 
converting the vocal browsing request to a text browsing request by the audio 
interface, and searching the network by a wireless application interface 
configured to search the network according to the text browsing request. The 
method further includes outputting by the audio interface at least one search result 
from the wireless application interface in response to the text browsing request. 

Another aspect of the present invention is a voice portal for 
delivering on-demand audio content to a user. The voice portal includes a telco 
interface configured to interface a network, a content transformer configured to 
provide retrieval of information to the telco interface, and an audio interface 
configured to speech process a vocal request received through the telco interface 
from the user over the network to retrieve a request for audio content from the 
content transformer. 

Another aspect of the present invention is voice portal for 
delivering on-demand audio content to a user. The voice portal includes a 
wireless application interface configured to interface a wireless network and a 
content transformer configured to provide retrieval of information to the wireless 
application interface. The voice portal also includes a wireless application server 
configured to process a request received through the wireless application interface 
from a mobile user over the wireless network to retrieve a request for audio 
content from the content transformer. 

II. BRIEF DESCRIPTION OF THE DRAWINGS 

Features and advantages of the present invention will become 
apparent to those skilled in the art from the following description with reference 
to the drawings, in which: 

Fig. 1 illustrates a block diagram of an embodiment of a voice 
portal according to the principles of the present invention; 

Fig. 2 illustrates a more detailed block diagram of an embodiment 
of a telco interface of the voice portal shown in Fig. 1 ; 
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Fig. 3 illustrates a more detailed block diagram of an embodiment 
of a content transformer of the voice portal shown in Fig. 1; 

Figs. 4a-4c illustrates a flow diagram of an embodiment of an 
audio interface for the voice portal shown in Fig. 1 ; 
5 Figs. 5a-5b illustrates a flow diagram of an embodiment of a 

recognition error processing for the audio interface of the voice portal; 

Figs. 6a-6b illustrates a flow diagram of an embodiment of a 
profile submenu for the audio interface of the voice portal; 

Figs. 7a-7c illustrates a flow diagram of an embodiment of a 
10 business submenu for the audio interface of the voice portal; 

Figs. 8a-8b illustrates a flow diagram of an embodiment of an 
invalid submenu selection processing module for the audio interface of the voice > 
portal; ^ 

Fig. 9 illustrates a block diagram of an embodiment implementing 
1 5 a wireless application protocol service to the voice portal; 

Fig. 10 illustrates a block diagram of an embodiment implementing 
a business connection service to the voice portal; and 

Figs. 1 la-1 lb illustrates a flow diagram of an embodiment of an 
attendant processing service for the voice portal. 

20 

III . DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

The present invention relates to a system for on-demand user- 
specific audio content. The system may include a telco interface, a content 
transformer, and interfaces to data providers. The telco interface may be 

25 configured to provide users an access to the system over a public switched 
telephone network. A user may access the system utilizing a single telephone 
number, which may be toll-free to provide an easier method of accessing the 
system. As the system answers a call from a user, the telco interface may be 
further configured to provide an interactive audio interface for users to access and 

30 to retrieve user-specific audio content from the system. With the audio interface, 
users may vocally request an item of information. Automated voice recognition 
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units of the telco interface speech process the request into a speech data value, 
which is forwarded to the content transformer. The content transformer returns 
the corresponding audio content related to the speech data value, which is then 
replayed to users. 

5 Unlike prior art systems that require a user traverse each level of a 

menu, the inventive audio interface of the system may provide an option to hear 
the available audio content and/or may provide a second option for a user to 
directly retrieve a desired item of audio content. A user may directly request a 
desired item of information by vocalizing the item of information. The vocal 

10 request is captured by the automated speech recognition units of the telco 
interface when prompted by the audio interface of the system. The automated 
speech recognition units may be further configured to apply speech or voice 
recognition algorithms on the captured speech utterance. The speech recognition 
algorithms return an electronic value or speech data, value, which can be verified 

15 by the audio interface of the system as a valid item of information. 

The speech data value may be then passed by a local area network 
interface of the telco interface to the content transformer. Upon receipt of the 
electronic value of the speech utterance, the content transformer may be 
configured to retrieve a data value associated with the electronic value of the 

20 speech utterance. The content transformer may be further configured to packetize 
the data value, and to return packetized data value to the telco interface. 

. Upon receipt of the packetized data value, the telco interface may 
be further configured to apply data-to-speech algorithms to transform the data 
value into speech that is heard by the user if the requested item of information is 

25 in text data format. Otherwise, the telco interface may replay the retrieved item of 
information. Accordingly, by using speech recognition techniques, a user is 
provided with the capability of navigating the system without the use of the 
keypad of the user's telephone. The safety and convenience of the user while 
driving is greatly improved. Moreover, since a user may vocally select a menu 

30 item without hearing all menu items, a user is not required to memorize keypad 
sequences to retrieve a desired item of information. 
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The system may be further configured to provide for a standby 
attendant. The standby attendant may provide for the capability of uninterrupted 
service in the event that the speech recognition algorithms is not able to convert a 
speech utterance of a user into a speech data value. The standby attendant may 
5 listen to a speech utterance of a user simultaneously as the speech recognition 
algorithms of the telco interface process the speech utterance. If the telco 
interface is unable to process the speech utterance, the standby operator may send 
the respective data value to the content transformer. Accordingly, a user may be 
provided with seemless and uninterrupted operation. 

10 The system may be further configured to include a profile of a user 

to customize the audio content for automated retrieval. A user may be provided 
the capability to set up a profile of preferred of items of information that a user 
would like to hear when using the system. The profile may be set up through an 
Internet interface of the system. Once the profile is activated by the system, a 

15 user may be given the items of information that the user desires in response to the 
user logging into the system. Accordingly, a user may be given user-specific 
audio content on demand without waiting for menu options to be played. 

The system may be yet further configured to include a capability 
for a user to retrieve a telephone of a partnered business or to be connected to the 

20 partnered business directly or to the partnered business' representative. A user 
may call into the system, and upon hearing of an interested partnered business, the 
user may vocalize the name of the partnered business and be connected through 
the system via a public switched telephone network to the partnered business or 
its representative. Alternatively, a user may be afforded the capability to hear a 

25 telephone number of a selected partnered business in response to vocalizing the 
name of the selected partnered business. Accordingly, a user may be given toll- 
free and convenient access to a number of business in partnership with the system, 
thus providing a user with a convenient and inexpensive service. 

Fig. 1 illustrates a block diagram of an exemplary embodiment of 

30 the voice portal 100 according to the principles of the present invention. In , 
particular, the voice portal 100 includes a telco interface 110, a content 
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transformer 120, data providers 130a... 130n. The telco interface 110 may be 
configured to provide an interface between the voice portal 100 and a public 
switched telephone network ("PSTN") 102. A user may access the voice portal 
100 by dialing a single telephone number over the PSTN 102. The telephone 
5 number may be a toll-free number, thus providing cost-free access to the voice 
portal 100. Alternatively, a number of telephones numbers, each local to a given 
geographical area could also be provided to provide lowest-cost access to the 
voice portal 100. 

The telco interface 110 may be further configured to apply 
10 automated speech or voice recognition algorithms on speech utterances of a user 
along with an audio interface for a user or caller to navigate the voice portal 100. 
The speech recognition algorithms of the telco interface 110 may output a 
corresponding speech data value for the speech utterance. The audio interface of 
the voice portal 100 may be configured to verify that the speech data value is a 
15 valid selection for the voice portal 100. The telco interface may be further 
configured to packetize the speech data value, and to send the packetized data 
value to the content transformer 120. 

The content transformer 120 may be configured to retrieve a 
corresponding audio content in response to the receipt of the data value from the 
20 telco interface 110. The content transformer 120 may be further configured to 
packetize the corresponding audio content for delivery to the telco interface 110. 
The telco interface 110 may be further configured to transmit the received audio 
content from the content transformer 120 to a user. 

The content transformer 120 may be further configured to interface 
25 with data providers 130a... 130n. The interface may include a Tl interface 
between the content transformer 120 and the respective data providers 
130a... 130n. However, other interfaces such as X.25, digital subscriber line, 
Internet, local area network, etc., are also contemplated by the present invention. 

The data providers 130a...l30n may be configured to provide 
30 content to the content transformer 120. The content may be audio, digital audio 
or text data. Moreover, the data providers may a variety of providers providing 
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information from Financial, weather, sports, television, news, etc. For example, 
data provider 130 may be configured to provide traffic updates for a given 
geographic area. 

Fig. 2 illustrates a more detailed block diagram of an embodiment 
5 of the telco interface 1 10 shown in Fig. 1. As shown in Fig. 2, the telco interface 
110 may include a Tl interface 210, a voice recognition unit ("VRU") 220, a 
controller 230, a memory 240, and a content transformer ("CT") interface 250. 
The Tl interface 210 may be configured to interface with the PSTN 102 of Fig. 1 
through a Tl line. The Tl interface 210 may provide simultaneous access for up 

1 0 to twenty-four different users. 

The Tl interface 210 may be configured to interface with the VRU 
220. The VRU 220 may be configured to record speech utterances of a user of 
the voice portable 100 and to apply automated speech or voice recognition 
algorithms on the recorded speech utterances of a user of the voice portal 100. 

15 The VRU 220 may be further configured to transfer the output of the speech 
recognition algorithms on the recorded speech utterances to the controller 230. 

The VRU 220 may be further configured to receive requested 
audio content from the content transformer 120 and transmit the received audio 
content to a caller. The audio content may include pre-recorded audio content or 

20 the audio content may include textual data. In the event of receiving textual data, 
the VRU 220 may be further configured to apply speech synthesis algorithms on 
the received textual data to convert the textual data to audio data for playback to 
the caller. 

The VRU 220 may be further configured to interface with the 
25 controller 230. The controller 230 may be configured to provide an execution 
platform for an application software that provides the functionality of the VRU 
220 and for the audio interface. The controller 230 may be a microprocessor, a 
micro-controller, a digital signal processor or the like. 

The controller 230 may be further configured to packetize the 
30 output of the speech recognition algorithms the content transformer through the 
CT interface 250. The controller 230 may be further configured to receive 
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content from the content transformer 120 and de-packetize the received content 
for the VRU 220. 

The controller 230 may also be configured to be interfaced with a 
memory 240. The memory 240 may be configured to store the application 
5 software for execution by the controller 230. The controller 230 may be further 
configured to be interfaced with the content transformer interface 250. The 
content transformer interface 250 may be configured to transmit and receive 
message packets between the content transformer 120 and the telco interface 1 10. 

Fig. 3 illustrates a more detailed block diagram of an exemplary 
10 embodiment of a content transformer 120 of the voice portal shown in Fig. 1 . In 
particular, the content transformer 120 includes a CT/telco interface 302 
configured to provide an interface for the exchange of data packets between the 
content transformer 120 and the telco interface 1 10. The CT/telco interface 302 
may be further configured to interface with a CT controller 304. The CT 
15 controller 304 may be configured to provide an execution platform for an 
application software that provides the functionality of the content transformer 
120. The CT controller 304 may be implemented with a microprocessor, a micro- 
controller,^ digital signal processor or the like. 

The CT controller 304 may be also configured to interface with a 
database 306. The database 306 provides storage and retrieval of items of 
information for the voice porta! 100. The database 306 may be implemented 
using a RAID-array of disks, a single large disk or the like. Some items of 
information stored on the database 306 may be periodically updated through a 
content provider interface 308, which may also be configured to interface with the 
25 CT controller 304. Data content providers may send updates to the content 
transformer 120 where the CT controller 304 would use the updates to refresh the 
content in the database 306. Alternatively, the data providers may by-pass the CT 
controller 304. Instead, data providers may send their updates to database directly 
through the content provider interface 308. 
30 The content transformer 120 may further comprise a subscriber 

profile database 312, which may be configured to give an automated delivery of 
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pre-selected information whenever a subscriber calls into the voice portal 100. 
The subscriber profile may be created by accessing the voice portal 100 through 
an Internet interface 320. A computer-based user may then create a subscriber 
profile through a web site of the voice portal 100, which is then stored in the 
5 subscriber profile database 312. When a user calls in and activates the user's 
subscriber profile, items of information requested during the creation the 
subscriber profile are delivered to the user. 

Alternatively, the subscriber profile of a user may be created 
passively by tracking the usage of the user through the voice portal 100. The 
10 controller 230 may maintain a database of the types of information requested by 
the usage along with the frequency. From the pattern of usage, a subscriber 
profile for the user may be created indexed by the automatic number identification 
and/or caller identification information of the user. 

The subscriber profile database 312 may also be utilized by an 
15 advertisement engine 314. The advertisement engine 314 may be configured to 
provide relevant advertisements of businesses partnered with the voice portal 
according the subscriber profile. In this manner, a user is not inundated with 
irrelevant and/or unwanted advertisements. 

Figs. 4a-4c illustrate an exemplary embodiment of a flow diagram 
20 400 of a main menu of an audio interface of the voice portal 1 00 according to the 
principles of the present invention. In particular, with reference to Fig. 4a, the 
controller 230 of the voice portal 100 may be configured to provide an audio 
interface to prompt a user or caller with a greeting message of inquiring what type 
of information the user is seeking or if the user would like to hear a list of 
25 available options of types of information that is available, in step 402. However, 
in the event that a user has a pre-determined type of information such as a 
business submenu, a sports submenu, a traffic submenu, a weather submenu, a 
news submenu, an entertainment submenu, a profile submenu, a flight 
information submenu, etc., a user may be taken to the selected type of information 
30 in response to a vocalization of the selected type, in step 422, as shown in Fig. 4b. 
Otherwise, the audio interface of the voice portal 100 is then directed to step 404, 
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which is an entry point for the audio interface to return after providing 
information content to a user. Besides step 404, step 406 is another return entry 
point for the audio interface to return. In step 406, a user is prompted with an 
audio cue informing the user that the audio interface of the voice portal 100 has 
5 returned to the beginning. 

In step 407, the audio interface of the voice portal 100 waits for a 
period of time, e.g., three seconds, for the user to respond with a speech utterance 
or vocal input. If the period of time has elapsed, the controller 230 directs the 
audio interface of the voice portal 100 to prompt the user to state a choice of types 

10 of information or if the user would like to hear a list of available choices of types 
of information, in step 408. 

In step 410, the voice portal 100 waits for another period of time, 
e.g., three seconds, for the user to respond with a speech utterance or audio input. 
If the period of time has elapsed, the controller 230 directs the audio interface of 

15 the voice portal 100 to prompt the user to state a choice of type of information or 
if the user would like to hear a list of available choices of types of information, in 
step 412. 

In step 414, the audio interface of the voice portal 1 00 waits for yet 
another period of time, e.g., three seconds, for the user to respond with a speech 

20 utterance. If the user fails to responds within the period of time, the audio 
interface of the voice portal 100 is directed by the controller 230 to proceed to a 
recognition error processing as shown in Fig. 5, which is described in detail 
below. Otherwise, if the audio interface of the voice portal 100 detects a response 
from a user, in step 414, step 410 or step 407, the VRU 220 of the voice portal 

25 100 determines whether if the user had uttered a profile keyword, e.g., "profile", 
in step 418. 

If the detected speech response from the user is the profile 
keyword, the audio interface is directed by the controller 230 to a profile submenu 
of the audio interface of Fig. 6, which is discussed in detail below. 
30 ° n ^e other hand, if the detected speech response is determined 

not to be the profile keyword, but a valid submenu selection, the audio interface 
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may be directed by the controller 230 to a submenu processing in step 424, as 
shown in Fig. 4b. The audio interface of the voice portal may be organized by 
categories of information. These categories may include business, sports, traffic, 
weather, news, entertainment, horoscope, profile, and flight information. 
5 Else if the detected speech is not a valid submenu selection, but a 

request for a listing of available choices of the voice portal 100, the audio 
interface may be directed by the controller 230 to step 426. In step 426, the audio 
interface may be directed by the controller 230 to play an audio cue for the user 
listing the available choices of information maintained by the voice portal 100, in 

1 0 step 428. The audio interface of the voice portal 1 00 then proceeds to step 404 of 
Fig. 4a to wait for a response from the user, in step 430. 

Else if the detected speech processed by the VRU 220 is a request 
for assistance, e.g., "HELP", in step 432, the audio interface may be directed by 
the controller 230 to play an audio cue for the user explaining the location of the 

15 user in the audio interface and/or the possible options available for the user, in 
step 434. The audio interface of the voice portal 100 then proceeds to step 404 of 
Fig. 4a to wait for a response from the user, in step 430. 

Else if the detected speech processed by the VRU 220 is a request 
to record a comment for the service provider of the voice portal 100, e.g., 

20 "COMMENT", in step 436, the audio interface may be directed by the controller 
230 to prompt a pre-recorded message to instruct a user to leave a message, in 
step 438. When the user completes the message, the audio interface prompts the 
user with an audio cue thanking the user for leaving a message, in step 440. The 
audio interface is then directed to step 406 of Fig. 4a to wait for a response from 

25 the user. 

Else if the detected speech processed by VRU 220 is a request a 
repeat of an audio prompt, e.g., "REPEAT", from either step 407 or 410 or 414, in 
step 444, the audio interface may be directed by the controller 230 to proceed to 
step 406 of Fig. 4a, in step 446. 
30 Else if the detected speech processed by VRU 220 is a request to 

restart the audio interface, e.g., "GO TO BEGINNING", from either step 407 or 
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410 or 414, in step 448, the audio interface may be directed by the controller 230 
to go to step 406 of Fig. 4a, in step 446. 

Else, with reference to Fig. 4c, if the detected speech processed by 
VRU 220 is a request to logout or exit the voice portal 100, e.g., "GOODBYE", 
from either step 407 or 410 or 414, in step 450, the audio interface may be' 
directed by the controller 230 to prompt the user with a logout or goodbye 
message, in step 454. The audio interface then terminates the call from the user, 
in step 456. 

Else if the detected speech processed by VRU 220 is not within the 
vocabulary of the VRU 220, the controller 230 of the voice portal 100 directs the 
audio interface to proceed to the recognition error processing shown in Figs. 5a 
and 5b, which is described in detail below. 

Fig. 5 illustrates a flow diagram of an exemplary embodiment of 
the recognition error processing 500. The recognition error processing may be 
referenced when the audio interface cannot process a vocal input from a user. As 
shown in Fig. 5a, if the vocal input has been detected by the audio interface of the 
voice portal 100, in step 504, the audio interface is directed by the controller 230 
to prompt the user with an audio cue indicating inability to process the speech 
utterance and to request the user to repeat the speech utterance in step 506. 

If the speech utterance of the user is able to be processed by the 
VRU 220 of the voice portal 100 in step 508, the audio interface is directed by the 
controller 230 to proceed to step 404 of Fig. 4a in step 510. Otherwise, if the 
repeated speech utterance of the user is still unable to be processed by the VRU 
220 of the voice portal 100 in step 508, the controller 230 of voice portal 100 
determines whether or not a confirmation option was previously set by the user in 
step 512. 

If the confirmation option is set, the audio interface is directed by 
the controller to provide a list of utterances that may have been uttered by the 
user. The user is asked to confirm with a yes or no response after a replay of each 
utterance in step 514. The audio interface is either taken to the location of a 
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positive response, /.<?., the interface menu corresponding to the positively 
confirmed utterance, or returned to step 404 of Fig. 4a. 

If the confirmation option is not set from step 512, the audio 
interface is directed by the controller 230 to prompt the user with an audio cue 
5 explaining that it was unable to process the speech utterance and to repeat the 
speech utterance in step 516. 

If the VRU 220 is able to speech process the speech utterance, the 
audio interface is directed by the controller 230 to step 404 of Fig. 4a. Otherwise, 
the audio interface of the voice portal 100 prompts the user with an audio cue that 
10 it was unable to process the speech utterance and is returning to the beginning of 
the audio interface in step 522. The audio interface is then directed to step 406 of 
Fig. 4a, as discussed herein above. Alternatively, if the VRU 220 cannot output a 
data value for the repeated speech utterance from step 522, the audio interface of 
the voice portal 100 informs the user that it was unable to process the repeated 
15 speech utterance, and the audio interface returns the user to the beginning of the 
particular layer of menu prompt where the speech recognition problem occurred. 

If in step 504 f there was no response from the user, the controller 
230 directs the audio interface to prompt the user with an audio cue explaining 
that the voice portal 100 is unable to hear the response of the user and to please 
20 repeat the response, in step 526 with reference to Fig. 5b. 

If the VRU 220 of the voice portal 100 is able to detect a response, 
the audio interface is directed by the controller 230 to return to step 404 of Fig. 
4a, in step 530. Otherwise, the controller 230 directs the audio interface to 
prompt a user with an audio cue explaining that the voice portal 100 is unable to 
25 hear the response of the user and to please repeat the response in step 532. 

If the VRU 220 of the voice portal 100 is able to detect a response, 
the audio interface is directed by the controller 230 to return to step 404 of Fig. 
4a, in step 536. Otherwise, the controller 230 directs the audio interface to 
prompt a user with an audio cue explaining that the voice portal 100 is unable to 
30 hear the response of the user and is returning the user to the beginning, in step 
538. The audio interface is then directed to step 404 of Fig. 4a. 
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Figs. 6a and 6b illustrate a flow diagram 600 of an exemplary 
embodiment of a profile submenu processing. The audio interface reference the 
profile submenu processing when a user utters a profile keyword, e.g., 
"PROFILE", at a prompt. As shown in Fig. 6a, once the profile keyword has been 
uttered, the audio interface is directed by the controller to enter profile submenu 
processing, in step 602. The controller 230 of the voice portal determines 
whether the Automatic Number Identification f AMI") # and/or caller ID number 
is found, in step 604. The ANI may be the calling party's telephone number 
provided by the PSTN 102 or a telecommunication network. 

If the ANI # and/or caller ID number was not found, the audio 
interface of the voice portal 100 prompts the user with an audio cue that is 
configured to request an identification number such as a telephone number of a 
user, in step 606. If the user utters an identification number in step 606, the audio 
interface is directed by the controller 230 to proceed to validate the identification 
15 number in a database in step 608. Otherwise, if the ANI # and/or caller ID 
number was found in step 604, the controller 230 initiates a search of the database 
of the voice portal 100 to determine whether or not the identification number is 
valid in step 608. 

If the identification number is valid from step 610, the controller 
20 230 of the voice portal 100 accesses the profile of the user in step 614, and 
retrieves an appropriate advertisement or sponsorship message for the user in step 
616. The audio interface of the voice portal 100 may be configured to play the 
advertisement message for the user along with a lead-in prompt with the audio 
content determined from the profile of the user in step 618. 

After the audio content is finished in step 618, the audio interface 
may cue another additional advertisement message to the user in step 620. Then, 
the audio interface replays the information that the user has pre-selected in the 
subscriber profile of the user in step 622. After the pre-selected information is 
played in step 622, the audio interface of the voice portal 100 prompts the user 
that the personalized audio content of the user is finished in step 624. The audio 
interface then returns to the beginning of the start menu, step 404 of Fig. 4a. 
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Returning to step 610, if the identification number is not valid, the 
audio interface of the voice portal 100 prompts the user that the provided 
identification number does not match the user's previously given identification 
number, e.g., a registered telephone number in step 612. The audio interface 
5 prompts the user to repeat the identification number and returns to step 602. 

Returning to step 606, the audio interface of the voice portal 100 
may be configured to respond to a user response that does not include an 
identification number. 

With reference to Fig. 6b, if the detected speech response 
10 processed by the VRU 220 is a request for assistance, e.g., "HELP", from step 
606, in step 628, the audio interface may be directed by the controller 230 to play 
an audio cue for the user explaining the function of the profile submenu, in step ' 
630. The audio interface of the voice portal 100 then proceeds to step 602 of Fig. 
6a to wait for a response from the user, in step 632. 
15 Else if the detected speech response processed by the VRU 220 is a 

request to record a comment for the service provider of the voice portal 100, e.g., 
"COMMENT", from step 606, in step 436, the audio interface may be directed by 
the controller 230 to proceed to step 438 of Fig. 4, in step 636. 

Else if the VRU 220 speech processes the response from the user is 
20 a request a repeat of an audio prompt, e.g., "REPEAT", from step 606, in step 
638, the audio interface may be directed by the controller 230 to replay the last 
prompt or last item of information in step 640. 

Else if the VRU 220 speech processes the response from the user is 
a request to restart the audio interface, e.g., "GO TO BEGINNING" from step 
25 606, in step 642, the audio interface may be directed by the controller 230 to go to 
step 406 in step 644. 

Else if the detected speech response processed by the VRU 220 is a 
request to logout or exit the voice portal 100, e.g., "GOODBYE", from step 606, 
in step 646, the audio interface may be directed by the controller 230 to proceed 
30 to step 454 of Fig. 4. 
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Else if the V&U 220 is unable to process the response from the 
user, the controller 230 of the voice portal directs the audio interface to proceed to 
the recognition error processing shown in Figs. 5a and 5b, in step 648. 

Figs. 7a-7c illustrate an exemplary embodiment of a flow diagram 
700 for a business submenu of the audio interface of the voice portal 100, 
according to the principles of the present invention. In particular, in step 702, the 
audio interface of the voice portal 100 prompts the user to vocalize a choice of 
audio content from at least one category of business in response to a user 
vocalizing a speech utterance requesting the business submenu from step 422 of 
Fig. 4b. The categories of business may include stock information, business 
news, financial markets, industry reports, reports of gainers or losers, etc. 

With reference to Fig. 7a, in step 704, the audio interface of the 
voice portal 100 enters a wait state, awaiting a response from the user. If the 
VRU 220 of the voice portal detects a speech utterance from the user, VRU 220 
applies its speech recognition algorithms on the speech utterance and a speech 
data value is outputted. The speech data value is compared against the list of 
categories, or submenus, of information in the business category, in step 706. 

If the user, from step 706, did not utter a valid submenu selection, 
but enters an invalid submenu selection processing module of the audio interface, 
in step 708, the controller 230 directs the audio interface to the invalid submenu 
processing module, which is illustrated in Figs. 8a and 8b. The invalid submenu 
selection processing module may be called by other categories of submenus to 
process an invalid or non-existent menu selection. 

If the VRU 220 of the voice portal 100, from step 706, speech 
processes an audible request from a user to hear stock quote of a company, e.g., 
"TICKER", or the company's name, in step 710, the VRU 220 speech processes 
the request and the controller 230 verifies whether or not the audible request is a 
valid request, in step 712. 

If, from step 712, the controller 230 determines that the audible 
request for the company name is invalid, the controller 230 directs the audio 
interface of the voice portal 100 to the recognition error processing module shown 



18 



'WO 01/47218 



PCT/US00/34531 



in Figs. 5a and 5b, as described herein above. Otherwise, the controller 230 
determines that the audible request for the stock quote of the requested company 
is valid, the controller 230 retrieves an appropriate advertisement message and 
stock quote from the content transformer 250, in step 716. 
5 The audio interface of the voice portal 100 replays the retrieved ' 

advertisement message for the user followed by the retrieved stock quote, in step 
718. With reference to Fig. 7c, the controller 230 directs the audio interface of 
the voice portal 100 to prompt the user if the user would like another quote with a 
yes or no response in step 720. 

10 If, in step 722, the VRU 220 of the voice portal 100 speech 

processes a positive response, e.g., a "YES", the controller 230 of the voice portal 
100 directs the audio interface to prompt the user with an audio cue requesting the 
user to vocalize another company, in step 724, and the audio interface is then 
directed to step 716 of Fig. 7a. Otherwise, if the VRU 220 of the voice portal 100 

15 speech processes a negative response, e.g., a "NO", from step 722, the controller 
230 of the voice portal 100 directs the audio interface to prompt the user to 
vocalize either another business choice, list of selections, or to restart the audio 
interface from the beginning, in step 726. The controller 230 then directs the 
audio interface to return to step 704 to wait for a user vocal response. 

20 Returning to Fig. 7a, if the VRU 220 of the voice portal 100, from 

step 706, speech processes an audible request from a user to hear the audio 
content related to business news, in step 728, the controller 230 retrieves an 
appropriate advertisement message and the requested audio content from the 
content transformer 250, in step 730. The audio interface of the voice portal 100 

25 replays the retrieved advertisement message for the user followed by the retrieved 
audio content, in step 740. The controller 230 of the voice portal 100 directs the 
audio interface to prompt the user to vocalize either another business choice, list 
of selections, or to restart the audio interface from the beginning, in step 726. The 
controller 230 then directs the audio interface to return to step 704 to wait for a 

30 user vocal response. 
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If the VRU 220 of the voice portal 100, from step 706, speech 
processes an audible request from a user to hear the audio content related to 
financial markets, in step 742, the controller 230 retrieves an appropriate 
advertisement message and the requested audio content from the content 
5 transformer 250, in step 744. The audio interface of the voice portal 100 replays 
the retrieved advertisement message for the user followed by the retrieved audio 
content, in step 746. The controller 230 of the voice portal 100 directs the audio 
interface to prompt the user to vocalize either another business choice, list of 
selections, or to restart the audio interface from the beginning, in step 726 of Fig. 
10 7c. The controller 230 then directs the audio interface to return to step 704 of Fig. 
7a to wait for a user vocal response. 

If the VRU 220 of the voice portal 100, from step 706, speech 
processes an audible request from a user to hear the audio content related to 
gainers and losers, in step 748, the controller 230 retrieves an appropriate 
15 advertisement message and the requested audio content from the content 
transformer 250, in step 750. The audio interface of the voice portal 100 replays 
the retrieved advertisement message for the user followed by the retrieved audio 
content, in step 752. The controller 230 of the voice portal 100 directs the audio 
interface to prompt the user to vocalize either another business choice, list of 
20 selections, or to restart the audio interface from the beginning, in step 726 of Fig. 
7c. The controller 230 then directs the audio interface to return to step 704 of Fig. 
7a to wait for a user vocal response. 

If the VRU 220 of the voice portal 100, from step 706, speech 
processes an audible request from a user to hear the audio content related to initial 
25 public offerings ("IPO") of companies, in step 754, the controller 230 retrieves an 
appropriate advertisement message and the requested audio content from the 
content transformer 250, in step 756. The audio interface of the voice portal 100 
replays the retrieved advertisement message for the user followed by the retrieved 
audio content, in step 758. The controller 230 of the voice portal 100 directs the 
30 audio interface to prompt the user to vocalize either another business choice, list 
of selections, or to restart the audio interface from the beginning, in step 726 of 



20 



AVO 01/47218 



PCTAJSOO/34531 



Fig. 7c. The controller 230 then directs the audio interface to return to step 704 of 
Fig. 7a, to wait for a user vocal response. 

With reference to Fig. 7b, if the VRU 220 of the voice portal 100 
speech processes a request for reports about an industry maintained by the voice 
5 portal 100, in step 760, the controller 230 verifies that the request is a valid 
request, in step 762. If industry report name is valid from step 762, the controller 
230 retrieves an appropriate advertisement message and the requested audio 
content from the content transformer 250, in step 764. The audio interface of the 
voice portal 100 replays the retrieved advertisement message for the user 
10 followed by the retrieved audio content, in step 766. The controller 230 of the 
voice portal 100 directs the audio interface to prompt the user to vocalize either 
another industry choice, another business choice, or to restart the audio interface 
from the beginning, in step 768. 

If the VRU 220 of the voice portal 100 a request for another 
15 industry report in step 770, the audio interface is directed by the controller 230 to 
proceed to step 764. 

If the VRU 220 of the voice portal 100 a request for a listing of 
business choices, in step 722, the audio interface is directed by the controller 230 
to step 704. 

20 If the VRU 220 of the voice portal 100 a request to go to the 

beginning of the audio interface, the audio interface is directed by the controller 
230 to go to step 404 of Fig. 4a, and as previously described herein above. 

If the VRU 220 of the voice portal cannot by the speech 
recognition techniques, process the response of the user in step 784, the controller 

25 230 directs the audio interface to proceed to recognition error processing shown in 
Fig. 5, in step 786. 

Returning to step 762, if the industry report name is not valid, the 
VRU 220 of the voice portal 100 may speech processes a request from a user 
listing of the selection in this category of information, in step 778, the controller 

30 230 of the voice portal directs the audio interface to provide an audio cue listing 
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the available selections in this category in step 780. The controller 230 then 
directs the audio interface to step 762. 

Else if the VRU 220 of the voice portal 100 speech processes a 
request from a user listing what are selections or choices available to a user, in 
5 step 782, the controller 230 of the voice portal 100 directs the audio interface to 
step 780. 

Else if the VRU 220 of the voice portal 100 speech processes a 
request from a user requesting assistance in this category, in step 784, the 
controller 230 directs the audio interface to provide an audio cue to the user 
10 explaining possible actions for the user to perform in step 788. Then, the audio 
interface is directed by the controller 230 to proceed to step 764. Otherwise, from 
step 762, the controller 230 directs the audio interface to enter the recognition 
error processing module shown in Figs. 5a and 5b. 

The audio interface of the voice portal 100 may be further 
15 configured to provide processing of a group of commands after most prompts. 
The group of commands may include 'HELP, 'WHAT ARE MY CHOICES', 
'COMMENT, *NEXT, PREVIOUS 1 , 'MAIN MENU', START OVER', 'BACK', 
'SKIP', 'REPEAT', 'GO TO BEGINNING', 'GOODBYE', etc. 

The exemplary embodiment of the flow diagram for the business 
20 submenu shown in Figs. 7a-7c may be applied to other categories of audio content 
that user may be interested in constructing an audio interface thereof. The other 
categories may include traffic, weather, sports, news, entertainment, flight 
information, etc. 

Figs. 8a and 8b illustrate a more detailed flow diagram of an 
25 embodiment of an invalid submenu selection processing as shown in Figs. 7a-7c. 
The invalid submenu selection processing may be invoked by any submenu 
within the audio interface. As shown in Fig. 8a, if the VRU 220 of the voice 
portal 100 speech processes a request for assistance, e.g., speech utterance of 
"HELP", in step 802, the audio interface of the voice portal 100 prompts the user 
30 with an audio cue informing the user of the selections with reference to the calling 
submenu category and to ask the user to vocalize another response, in step 804. 
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The audio interface is then directed to the wait state of the calling submenu, e.g., 
step 704 of the business submenu shown in Fig. 7a, in step 805. 

If the VRU 220 of the voice portal 100 speech processes an audible 
request to replay an audible list of business category selections, in step 806, the 
5 audio interface of the voice portal 100 replays the list of audible list of business 
category selection in step 808. The audio interface is then directed to return to the 
wait state of the calling submenu, in step 805. 

If the VRU 220 of the voice portal 100 speech processes an audible 
request to leave a comment, in step 810, the audio interface of the voice portal is 
1 0 directed to step 438 of Fig. 4b, in step 8 1 2. 

If the VRU 220 of the voice portal 1 00 speech processes an audible 
request to move to the next item in the audio interface, in step 834,, the audio 
interface is directed to the replay the next available audio prompt for the user in 
the calling submenu, in step 816. 
15 If the VRU 220 of the voice portal 100 speech processes an audible 

request to replay the most recently played menu item, in step 818, the audio 
interface of the voice portal 100 is directed to play the last played audio prompt or 
the most recent delivered audio content requested by the user, in step 820. 

If the VRU 220 of the voice portal 100 speech processes an audible 
20 request from a user to direct the audio interface to start at the beginning of the 
main menu, e.g., "START OVER", in step 822, the audio interface of the voice 
portal 100 is directed to the wait state of the invoking submenu, e.g., step 404 of 
Fig. 4a. 

If the VRU 220 of the voice portal 1 00 speech processes an audible 
25 request from a user to back up to a previous prompt, e.g., "PREVIOUS", in step 
824, the audio interface of the voice portal 100 backs up to the previous prompt 
and replays an audio cue informing the user of the previous prompt, in step 826. 

If the VRU 220 of the voice portal 100 speech processes an audible 
request from a user to skip to a next prompt, e.g., "NEXT", in step 828, the audio 
30 interface of the voice portal 100 is directed by the controller 230 to the next 
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prompt of the calling submenu and replays an audio cue informing the user of the 
next prompt, in step 830. 

If the VRU 220 of the voice portal 100 speech processes an audible 
request from a user to repeat the most recent prompt, e.g., "REPEAT", in step 
5 832, the audio interface of the voice portal 1 00 is directed by the controller 230 to 
the most recent prompt of the calling submenu and replays an audio cue 
informing the user of the most recent prompt, in step 834. 

If the VRU 220 of the voice portal 1 00 speech processes an audible 
request from a user to direct the audio interface to go the beginning of the main 
10 menu, e.g., "GO BEGINNING", in step 836 the audio interface of the voice portal 
100 is directed by the controller 230 to step 406 of Fig. 4a, in step 838. 

If the VRU 220 of the voice portal 100 speech processes an audible 
request from a user to log out of the voice portal 100, e.g., "GOODBYE", in step 
840, the audio interface of the voice portal 100 is directed by the controller 230 to 
15 go to step 464 of Fig. 4b to terminate the call, in step 842. 

If the VRU 220 of the voice portal 100 in unable to speech process 
an audible request from a user, the controller 230 of the voice portal 100 directs 
the audio interface to enter the recognition error processing module shown in 
Figs. 5a and 5b. 

20 Fig. 9 illustrates an embodiment of a voice portal 900 with the 

added functionality of a wireless application server interface according to the 
principles of the present invention. In particular, the voice portal 900 includes a 
telco interface 902, a content transformer 904, a wireless application server 906, a 
digital audio content database 908, and content providers 9 10a... 91 On. The telco 

25 interface 902 may be configured similarly to the telco interface 1 10 of Figs. 1 and 
2, i.e., include the components and functionality of the telco interface 110, as 
described herein above. 

The content transformer 904 may be also be configured to include 
the components and functionality of the content transformer 120 shown in Figs. 1 

30 and 3. Moreover, the content transformer 120 may include the wireless 
application server 906 may be configured to retrieve a corresponding audio 
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content from the digital audio content database 908, in response to the receipt of 
the speech data value from the telco interface 902, where the digital audio content 
database 908 may be configured to provide storage and access to audio content 
provided by the voice portal 900. The digital audio content database 908 may be 
5 further configured to interface with data providers 910a... 9 lOn. The interface 
may include a Tl interface between the content transformer 904 and the 
respective data providers 910a.. .910n. However, other interfaces such as X.25, 
digital subscriber line, Internet, local area network, etc., are also contemplated by 
the present invention. 

10 The wireless application server 906 may be further configured to 

send digital audio content to mobile users through a wireless application interface 
907 of the wireless application server 906. The WAP interface 907 may be 
configured to transmit and receive data/command(s) from wireless application 
enabled portable electronic devices. The wire application server 906 may be 

15 further configured to communicate with wireless application enabled portable 
electronic devices, utilizing a wireless application protocol, the FLEX™ protocol 
or other wireless data protocols. 

As an example, a user with a WAP-enabled mobile telephone 914 
having speech recognition functionality, may retrieve audio content through the 

20 mobile telephone 914, by accessing the voice portal 900 through the Internet 916. 
Alternatively, a user with a WAP-enabled telephone 918 without speech 
recognition functionality, along with ordinary mobile and non-mobile telephones 
920, may access the voice portal 900 through a public switch telephone network 
912 over conventional wireless methods utilizing towers 922a or 922b or wired 

25 methods, respectively. 

Fig. 10 illustrates another embodiment of a voice portal with the 
added functionality of access of partnered businesses. In particular, a voice portal 
1000. The voice portal 1000 may be configured similarly as the voice portal 100, 
shown in Figs. 1-8, as described herein above. The voice portal 1002 further 

30 includes a partnered business database 1002. The partnered business database 
1002 may be configured to provide storage and access to the names and numbers 
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of the business that have partnered with the service providers of the voice portal 
100, i.e., partnered business. 

A user accessing the voice portal 1000, may vocalize a request for 
the audio interface of the voice portal 1000 to replay a list of businesses partnered 
5 with the voice portal 1000. The user may vocalize a request to select one of the 
businesses from the list once connected with the voice portal 1000 with a 
telephone 1006 or mobile telephone 1008 through the PSTN 1004. The controller 
230 of the voice portal may be further configured to connect the user with the 
selected business 1110 or its agent, a call center computer 112, over the PSTN 

10 1004 in response to verification of the selection. Alternatively, the audio interface 
may function as a directory and provide a telephone number of the selected 
business with an audio cue through the audio interface in response to verification 
of the selection from the partnered business database 1002. Accordingly, a user 
may obtain toll-free or PSTN number access to a partnered business through the 

15 voice portal 1000. 

Figs, lla-llb illustrate a flow diagram of attendant support of 
automated speech recognition function of the audio interface of the voice portal 
according to the principles of the present invention. In particular with reference 
to Fig. 1 la, a user or caller into the voice portal 100 is prompted by the audio 

20 interface to vocalize a request, in step 1102. The VRU 220 of the voice portal 
100 speech processes the request, in step 1 104. 

If the VRU 220 of the voice portal 100 speech processes a valid 
speech data value from step 1 106, the user's request is automatically processed, in 
step 1 108 of Fig. 1 lb, as discussed herein above. Otherwise, if the VRU 220 can 

25 not speech process the request, the controller 230 directs the audio interface of the 
voice portal 100 to prompt the user with an audio cue to repeat the previous 
request, in step 1110. 

If the VRU 220 of the voice portal 100 speech processes the 
repeated request as a valid speech data value, in step 1112, the user's request is 

30 automatically processed, in step 1108 of Fig. lib, as described herein above. 
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Otherwise, the repeated request is replayed to an attendant or operator of the voice 
portal 100, in step 1114. 

With reference to Fig. 1 lb, if the attendant identifies the repeated 
request, in step 1116, the attendant notifies the audio interface of the voice portal 
5 100 the identified request, in step 1118. The notification may be conducted 
through a touch screen identifying all the possible choices of the voice portal 100, 
a message to the voice portal 100, etc., as known to those of ordinary skill in the 
art. 

Otherwise, if the attendant cannot identify the repeated request or 
10 phrase, the attendant notifies the voice portal to prompt the user to spell out the 
request, in step 1120. The user spelling out of the request is also recorded and 
sent to the attendant, in step 1 120. 

If the attendant identifies the spelled out request, in step 1 122, the 
attendant notifies the audio interface of the voice portal 100 the spelled out 
15 request, as in step 1118. Otherwise, the user's request cannot be executed and the 
call is terminated, in step II 24. 

While the invention has been described with reference to the 
exemplary embodiments thereof, those skilled in the art will be able to make 
various modifications to the described embodiments of the invention without 
20 departing from the true spirit and scope of the invention. 



27 



WO 01/47218 PCT/US00/34531 



CLAIMS 

What is claimed is: 

5 LA method of delivering audio content to a user, said 

method comprising: 

requesting an audio query by said user over a telephone; 
translating said audio query into a database query for requesting 
said audio content from an audio content provider; and 
10 transmitting an audio response of said database query to said user 

over said telephone. 

2. The method of delivering audio content to a user according 
to claim 2, further comprising: 

15 providing said user a plurality of audio queries; and 

recognizing said audio query in response to a selection of said 
audio query from said plurality of audio queries. 

3. The method of delivering audio content to a user according 
20 to claim 3, further comprising: 

retrieving said audio response in response to said database query. 

4. The method of delivering audio content to a user according 
to claim 4, further comprising: 

25 accessing said audio content provider over a public switched 

telephone network. 
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5. The method of delivering audio content to a user according 
to claim 1 , further comprising: 

providing a sponsorship message to said user over said telephone 
in response to said audio query; and 
5 requesting consumer data from said user. 

6. The method of delivering audio content to a user according 
to claim 1 , further comprising: 

providing a sponsorship message to said user over said telephone 
10 in response to said audio query; and 

providing a link to said user, said link configured to provide 
additional information of a sponsor of said sponsorship message. 

7. The method of delivering audio content to a user according 
15 to claim 1 , further comprising: 

providing a sponsorship message to said user over said telephone 
in response to said audio query; and 

providing an option for said user to access a live operator. 

20 8. The method of delivering audio content to a user according 

to claim 1 , further comprising: 

providing a plurality of menu selections, each menu selection of 
said plurality of menu selections configured to represent a link to a corresponding 
audio content; and 

25 navigating said plurality of menu selections in response to a vocal 

selection of one of said plurality of menu selections. 
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9. The method of delivering audio content to a user according 
to claim 8, further comprising: 

storing said vocal response; and 

retrieving said audio content from said audio content provider in 
5 response to said vocal response. 

10. A method of delivering audio content to a user, said 
method comprising: 

receiving a vocal request at an audio interface of an audio content 
10 provider from said user, said audio interface configured to convert said vocal 
request into an electronic query; 

retrieving a requested audio content from said audio content 
provider by said audio interface in response to said electronic query; and 

outputting said requested audio content by said audio interface, 
15 wherein said audio interface is further configured to output said requested audio 
content to said user over a public switched telephone network. 

11. A method of browsing a network, said method comprising: 
receiving a vocal browsing request at an audio interface of a 

20 content provider from a user; 

converting said vocal browsing request to a text browsing request 
by said audio interface; 

searching said network by a wireless application interface 
configured to search said network according to said text browsing request; and 
25 outputting by said audio interface at least one search result from 

said wireless application interface in response to said text browsing request. 
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12. A voice portal for delivering on-demand audio content to a 
user, said voice portal comprising: 

a telco interface configured to interface a network; 

a content transformer configured to provide retrieval of 
5 information to said telco interface; and 

an audio interface configured to speech process a vocal request 
received through said teleco interface from said user over said network to retrieve 
a request for audio content from said content transformer 



10 13. The voice portal for delivering on-demand audio content to 

a user according to claim 12, wherein said telco interface further comprising: 

a voice recognition unit configured to apply speech recognition 

techniques on said vocal request and further configured to output a corresponding 

speech data value of said vocal request; 
15 a content transformer interface configured to provide an interface 

between said content transformer and said telco interface; and 

a controller configured to packetize said corresponding speech data 

for transmission to said content transformer through said content transformer 

interface. 

20 

14. The voice portal for delivering on-demand audio content to 
a user according to claim 13, wherein said content transformer comprising: 

a database configured to provide storage and access to content 
maintained by said voice portal; and 
25 a content transformer controller configured to retrieve audio 

content from said database and to transmit said audio content to said telco 
interface in response to receiving of said corresponding speech data from said 
telco interface. 
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15. The voice portal for delivering on-demand audio content to 
a user according to claim 14, wherein said content transformer further comprising: 
a content provider interface configured to receive content provider 
information from a content provider, wherein said content transformer controller 
5 is further configured to store said received content provider information to said 
database. 



16. The voice portal for delivering on-demand audio content to 
a user according to claim 15, wherein said content transformer interfaces includes 

10 aTlline. 

17. The voice portal for delivering on-demand audio content to 
a user according to claim 13, wherein said content provider information at least 
one of a news information, a sports information, a financial information, a flight 

15 information, a weather information, and a business information. 



18. The voice portal for delivering on-demand audio content to 
a user according to claim 13, wherein said audio interface is configured to provide 
for a standby attendant to process vocal requests that said voice recognition unit 

20 cannot output said corresponding data value for said vocal request. 

19. The voice portal for delivering on-demand audio content to 
a user according to claim 1 3, further comprising: 

an advertising engine configured to provide sponsor messages to 
25 said audio interface; 

a subscriber profile configured to provide pre-selected audio 
content to said audio interface, wherein said audio interface is further configured 
to provide a relevant sponsor message according to said subscriber profile. 
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20. The voice porta] for delivering on-demand audio content to 
a user according to claim 19, wherein: 

said subscriber profile is created by tracking usage patterns of said 

user. 

5 

21. The voice portal for delivering on-demand audio content to 
a user according to claim 1 9, wherein: 

said subscriber profile is created through a web site of said voice 

portal. 

10 

22. The voice porta! for delivering on-demand audio content to 
a user according to claim 12, wherein said voice recognition unit is further 
configured to provide speech synthesis on text data received from said content 
transformer. 

15 

23. The voice portal for delivering on-demand audio content to 
a user according to claim 12, wherein said telco interface includes a Tl interface 
configured to interface said telco interface with said network. 

20 24. The voice portal for delivering on-demand audio content to 

a user according to claim 23, wherein said Tl interface includes access for 
twenty-four users. 

25. The voice portal for delivering on-demand audio content to 
25 a user according to claim 12, wherein said audio interface is configured to provide 
a connection to a partnered business in response to a request for said partnered 
business. 
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26. A voice portal for delivering on-demand audio content to a 
user, said voice portal comprising: 

a wireless application interface configured to interface a wireless 

network; 

5 a content transformer configured to provide retrieval of 

information to said wireless application interface; and 

a wireless application server configured to process a request 
received through said wireless application interface from a mobile user over said 
wireless network to retrieve a request for audio content from said content 
10 transformer. 

27. The voice portal for delivering on-demand audio content to 
a user according to claim 26, wherein said wireless application interface 
substantially conforming to a wireless data protocol. 

15 

28. The voice portal for delivering on-demand audio content to 
a user according to claim 27, wherein said wireless data protocol includes a 
wireless application protocol. 
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